智能论文笔记

Component Segmentation of Engineering Drawings Using Graph Convolutional Networks

Wentai Zhang , Joe Joseph , Yue Yin , Liuyue Xie , Tomotake Furuhata , Soji Yamakawa , Kenji Shimada , Levent Burak Kara

分类：计算机视觉 | 机器学习

2022-12-01

We present a data-driven framework to automate the vectorization and machine interpretation of 2D engineering part drawings. In industrial settings, most manufacturing engineers still rely on manual reads to identify the topological and manufacturing requirements from drawings submitted by designers. The interpretation process is laborious and time-consuming, which severely inhibits the efficiency of part quotation and manufacturing tasks. While recent advances in image-based computer vision methods have demonstrated great potential in interpreting natural images through semantic segmentation approaches, the application of such methods in parsing engineering technical drawings into semantically accurate components remains a significant challenge. The severe pixel sparsity in engineering drawings also restricts the effective featurization of image-based data-driven methods. To overcome these challenges, we propose a deep learning based framework that predicts the semantic type of each vectorized component. Taking a raster image as input, we vectorize all components through thinning, stroke tracing, and cubic bezier fitting. Then a graph of such components is generated based on the connectivity between the components. Finally, a graph convolutional neural network is trained on this graph data to identify the semantic type of each component. We test our framework in the context of semantic segmentation of text, dimension and, contour components in engineering drawings. Results show that our method yields the best performance compared to recent image, and graph-based segmentation methods.

translated by 谷歌翻译

对称性是本质上的无所话话，并且由许多物种的视觉系统感知，因为它有助于检测我们环境中的生态重要的物体类。对称感知需要抽象图像区域之间的非局部空间依赖性，并且其底层的神经机制仍然难以捉摸。在本文中，我们评估了深度神经网络（DNN）架构关于从示例学习对称感知的任务。我们证明了在对象识别任务上建模人类性能的前馈DNN，不能获取对称的一般概念。即使当DNN被重建以捕获非局部空间依赖项，例如通过`扩张的“卷曲和最近引入的”变压器“设计，也是如此。相比之下，我们发现经常性架构能够通过将非局部空间依赖性分解成一系列本地操作来学习对称性，这对于新颖的图像来说是可重复使用的。这些结果表明，经常性联系可能在人工系统中对称性感知中发挥重要作用，也可能是生物学的。

translated by 谷歌翻译

在许多实际控制应用中，由于植物特征的变化，闭环系统的性能水平随着时间而变化。因此，在不经过系统建模过程的情况下，非常需要重新设计控制器，这对于闭环系统通常很难。强化学习（RL）是一种有前途的方法之一，仅基于闭环系统的测量，可以为非线性动力学系统提供最佳控制器的无模型重新设计。但是，RL的学习过程需要使用可能会在植物上累积磨损的控制系统不良的系统进行大量试验实验。为了克服这一限制，我们提出了一种无模型的两步设计方法，该方法在未知非线性系统的最佳调节器重新设计问题中提高了RL的瞬态学习性能。具体而言，我们首先设计了一种线性控制定律，该法律以无模型的方式达到一定程度的控制性能，然后通过并行使用设计的线性控制法来训练非线性最佳控制法。我们引入了一种线性控制定律设计的离线RL算法，并理论上保证了其在轻度假设下与LQR控制器的收敛性。数值模拟表明，所提出的方法可以提高RL的超参数调整中的瞬态学习性能和效率。

translated by 谷歌翻译

对象识别和视点估计位于视觉理解的核心。最近的作品表明，卷积神经网络（CNNS）未能概括为分发（OOD）类别 - 观点组合，即。在训练期间没有看到的组合。在本文中，我们通过评估培训的CNNS来调查何时以及如何以及如何以及如何进行分类对象类别和3D观点，并识别有助于这种ood泛化的神经机制。我们表明，增加了分布式组合的数量（即数据分集），即使具有相同数量的培训数据，也会大大提高了oo ood组合的概率。我们比较学习类别和观点在单独的和共享网络架构中，并观察分销和ood组合的显着不同趋势，即。虽然共享网络有用的分配有用，但单独的网络在ood组合中显着优于共享的网络。最后，我们证明，通过专业化的神经机制，即。两种类型神经元的出现 - 神经元选择类别和不变性观点，反之亦然。

translated by 谷歌翻译